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Abstract 

We consider an interacting particle Markov process for Darwinian evolu¬ 
tion in an asexual population with non-constant population size, involving a 
linear birth rate, a density-dependent logistic death rate, and a probability p 
of mutation at each birth event. We introduce a renormalization parameter 
K scaling the size of the population, which leads, when K —► +oo, to a de¬ 
terministic dynamics for the density of individuals holding a given trait. By 
combining in a non-standard way the limits of large population (K —> +oo) 
and of small mutations (/r —> 0), we prove that a time scales separation be¬ 
tween the birth and death events and the mutation events occurs and that 
the interacting particle microscopic process converges for finite dimensional 
distributions to the biological model of evolution known as the “monomorphic 
trait substitution sequence” model of adaptive dynamics, which describes the 
Darwinian evolution in an asexual population as a Markov jump process in 
the trait space. 

Keywords: measure-valued process; interacting particle system; mutation-selection 
processes; Darwinian evolution; trait substitution sequence; adaptive dynamics; fi¬ 
nite dimensional distributions convergence; time scale separation; stochastic dom¬ 
ination; branching processes; large deviations. 

AMS subject classification: 60F99; 60K35; 92D15. 


1 Introduction and main results 

We will study in this article the link between two biological models of Darwinian 
evolution in an asexual population. The first one is a system of interacting particles 
modeling evolution at the individual level, referred below as the microscopic model, 
which has been already proposed and studied in Bolker and Pacala (1997, 1999), 
Dieckmann and Law (2000), Law et al. (2003) and Fournier and Meleard (2004) 
either as a model of Darwinian evolution or as a model of dispersal in a spatially 
structured population. This model involves a finite population with non-constant 
population size, in which each individual’s birth and death events are described. 
Each individual’s ability to survive and reproduce is characterized by a finite num¬ 
ber of phenotypic traits (e.g. body size, rate of food intake, age at maturity), or 
simply traits. The birth rate of an individual depends on its phenotype, and its 
death rate depends on the distribution of phenotypes in the population and involves 
a competition kernel of logistic type. A mutation may occur at each birth event. 
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The second model describes the evolution at the population level as a jump 
Markov process in the space of phenotypic traits characterizing individuals. It 
is called “trait substitution sequence” (Metz et al., 1996), and referred below as 
the TSS model. In this model, the population is monomorphic at each time (i.e. 
composed of individuals holding the same trait value), and the evolution proceeds 
by a sequence of appearance of new mutant traits, which invade the population 
and replace, after a short competition, the previous dominant trait. The TSS 
model belongs to the recent biological theory of evolution called adaptive dynamics 
(Hofbauer and Sigmund, 1990; Marrow et al., 1992; Metz et al., 1992), and has 
been introduced by Metz et al. (1996) and Dieckmann and Law (1996) and math¬ 
ematically studied in Champagnat et al. (2001). The theory of adaptive dynamics 
investigates the effects of the ecological aspects of population dynamics on the 
evolutionary process, and thus describes the population on the phenotypic level, 
instead of the genotypic level. The TSS model is one of the fundamental models of 
this theory. It has revealed a powerful tool for understanding various evolutionary 
phenomena, such as polymorphism (stable coexistence of different traits, cf. Metz 
et al., 1996) or evolutionary branching (evolution of a monomorphic population to 
a polymorphic one that may lead to speciation, Dieckmann and Doebeli, 1999) and 
is the basis of other biological models, such as the “canonical equation of adaptive 
dynamics” (Dieckmann and Law, 1996; Champagnat et al., 2001). 

The heuristics leading to the TSS model (cf. Metz et al., 1996 and Dieckmann 
and Law, 1996) are based on the biological assumptions of large population and 
rare mutations, and on another assumption stating that no two different types of 
individuals can coexist on a long time scale: the competition eliminates one of 
them. In spite of this heuristic, this model still lacks a firm mathematical basis. 

We propose to prove in this article a convergence result of the microscopic model 
to the TSS model when the parameters are normalized in a non-standard way, lead¬ 
ing to a time scales separation. Our limit combines a large population asymptotic 
with a rare mutations asymptotic. It will appear that this convergence holds only 
for finite dimensional distributions, and not for the Skorohod topology, for reasons 
that are linked to the time scale separation. For these reasons, and because we 
have to combine two limits simultaneously (large population and rare mutations), 
this result is different from classical time scale separation results (averaging prin¬ 
ciple, cf. Freidlin and Wentzell, 1984). The proof requires original methods, based 
on comparison, convergence and large deviation results on branching processes and 
logistic Markov birth and death processes. Our convergence result provides a math¬ 
ematical justification of the TSS model and of the biological heuristic on which it 
is based, and gives precise conditions on the scalings of the biological parameters 
in the microscopic model required for the time scales separation to hold. 

In Section [3 we describe precisely the microscopic model and the TSS model, 
and we state our main results. Our proof is based on a careful study of the behavior 
of the population before the first mutation, and of the competition phase between 
the mutant trait and the original trait, taking place just after the first mutation. 
We will give an outline of the proof and of the methods in Section[3 as well as some 
notations used throughout the paper. Section Ogives comparison results and large 
deviation results on birth and death processes (Sections 14.11 and 14.211 . and several 
results on branching processes (Section 14.81) . Based on these properties, the proof 
of the convergence of the microscopic model to the TSS model is given in Sectional 

2 Models and main results 

Let us first describe the microscopic model. In a population, Darwinian evolution 
acts on a set of phenotypes, or traits , characterizing each individual’s ability to 
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survive and reproduce. We consider a finite number of quantitative traits in an 
asexual population (clonal reproduction), and we assume that the trait space X is 
a compact subset of M. 1 (I > 1). 

The microscopic model involves the three basic mechanisms of Darwinian evo¬ 
lution: heredity , which transmits traits to new offsprings, mutation, driving a varia¬ 
tion in the trait values in the population, and selection between these different trait 
values. The selection process, and thus a proper definition of the selective ability 
of a trait, or fitness (cf. Metz et al., 1992), should (and will) be the consequence 
of interactions between individuals in the population and of the competition for 
limited resources or area, modeled as follows. 

For any x, y £ X, we introduce the following biological parameters 

b(x) £ K+ is the rate of birth from an individual holding trait x. 

d(x) £ R+ is the rate of “natural” death for an individual holding trait x. 

a(x, y ) £ R + is competition kernel representing the pressure felt by an individual 
holding trait x from an individual holding trait y. 

y(x) £ [0,1] is the probability that a mutation occurs in a birth from an individual 
with trait x. 

m(x, dh) is the law of h = y—x, where the mutant trait y is born from an individual 
with trait x. It is a probability measure on M. 1 , and since y must belong to 
the trait space X, the support of m(x, •) is a subset of 

X — x = {y — x : y £ X}. 


K £ N is a parameter rescaling the competition kernel a(-, •). Biologically, K can 
be interpreted as scaling the resources or area available, and is related to the 
biological concept of “carrying capacity”. It is also called “system size” by 
Metz et al. (1996). As will appear later, this parameter is linked to the size of 
the population: large K means a large population (provided that the initial 
condition is proportional to K). 

uk £ [0,1] is a parameter depending on K rescaling the probability of mutation 
/i(-). Small uk means rare mutations. 

Let us also introduce the following notations, used throughout this paper: 

b(x) — d(x) 

n x = 7 7 i 

ayx, x) 

(3{x) = n{x)b{x)n x 

and f(y, x) = b(y) - d(y) - a(y, x)n x 


(1) 

( 2 ) 

( 3 ) 


As will appear below, n x can be interpreted as the equilibrium density of a mono- 
morphic population when there is no mutation, (5{x) as the mutation rate in this 
population, and f(y, x) as the fitness of a mutant individual with trait y in this 
population. 

We consider, at any time t > 0, a finite number N t of individuals, each of them 
holding a trait value in X. Let us denote by x\, ..., xjv t the trait values of these 
individuals. The state of the population at time t > 0, rescaled by K, can be 
described by the finite point measure on X 



i—l 


( 4 ) 
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where S x is the Dirac measure at x. Let A4 f denote the set of finite nonnegative 
measures on X, and define 

M r = |^ 5 Xi : n > 0, aq,..., a:„ G A j , 

An individual holding trait x in the population vff gives birth to another indi¬ 
vidual with rate b(x ) and dies with rate 

/• j 

d(x)+ / a(x,y)isF(dy) = d(x) + — y^a(x,Xj). 

J i =1 

The parameter K scales the strength of competition, thus allowing the coexistence 
of more individuals in the population. 

A newborn holds the same trait value as its progenitor’s with probability 1 — 
uk^(x), and with probability uk^(x), the newborn is a mutant whose trait value 
y is chosen according to y = x + h, where h is a random variable with law m(x : dh). 

In other words, the process , t > 0) is a M K -valued Markov process with 
infinitesimal generator defined for any bounded measurable functions <f> from A4 R 
to R by 

L K (f)(v) = J \6^v + (1 - u K lx(x))b(x)Ku(dx) 

+ j J UKlx(x)b(x)m(x,dh)Ku(dx) 

+ lx ~~ if) “ + J x a ^ X ' Ku ( dx )- ( 5 ) 

When the measure v has the form CD. the integrals with respect to Ku(dx) in (0 
correspond to sums over all individual in the population. The first term (linear) 
describes the births without mutation, the second term (linear) describes the births 
with mutation, and the third term (non-linear) describes the deaths by oldness 
or competition. This logistic density-dependence models the competition in the 
population, and hence drives the selection process. 

Let us denote by (A) the following three assumptions 

(Al) b , d and a are measurable functions, and there exist b, d, a < +oo such that 

b(-) < b , d(-) < d and a(-, •) < a. 

(A2) m(x,dh) is absolutely continuous with respect to the Lebesgue measure on 
R ( with density m(x, h), and there exists a function fh : R ; —* R + such that 
m(x, h) < fh(h) for any x £ X and h £ R ; , and f m(h)dh < oo. 

(A3) n(x) > 0 and b(x) — d(x) > 0 for any x £ X , and there exists a > 0 such that 

a < a(-, •). 

For fixed K , under (Al) and (A2) and assuming that , 1 )) < oo (where 

{u, f) denotes the integral of the measurable function / with respect to the measure 
v) 7 the existence and uniqueness in law of a process with infinitesimal generator 
L k has been proved by Fournier and Meleard (2003). When K —> +oo, they 
also proved, under more restrictive assumptions and assuming the convergence 
ot the initial condition, the convergence on D(R + ,A(f) of the process v K to a 
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deterministic process solution to a non-linear integro-differential equation. We will 
only use particular cases of their result, stated in the next section, that can be 
proved under assumptions (Al) and (A2). 

The biological assumption of large population corresponds to the limit K —» 
+oo, and the assumption of rare mutations to uk — 1y 0. As mentionned in the 
introduction, the biological heuristics suggest another assumption: the impossibil¬ 
ity of coexistence of two different traits on a long time scale. As will appear in 
Proposition |3 in the next section, this assumption can be stated mathematically as 
follows: 

(B) Given any x G A, Lebesgue almost any y £ X satisfies one of the two following 
conditions: 

either (b(y) — d(y))a(x,x) — (b(x) — d{x))a{y,x) < 0 , ( 6 ) 

m . / (Kv) ~ d{y))a{x , x) - (b(x) - d(x))a(y, x) > 0 , , . 

\ (b(x) - d(x))a(y, y) - ( b(y ) - d(y))a{x, y) < 0 . 

Before coming back to this assumption in the next section, let us only observe that 
condition © is equivalent to f(y, x) < 0 and condition Q to f(y,x ) > 0 and 

f(x,y) < 0 . 

The TSS model of evolution that we obtain from the microscopic model is a 
Markov jump process in the trait space X with infinitesimal generator given, for 
any bounded measurable function (p from X to M, by 

Aip{x) = f ((p(x + h)~ ip(x))f3(x) ^^ +f *' X ^ + m(x, h)dh , ( 8 ) 

Jw b{x + h) 

where [a]+ denotes the positive part of a € R, and where j3{x) and f(y,x) are 
defined in m and ©• The existence and uniqueness in law of a process generated 
by A holds as soon as (3(x)[f(y,x)] + /b(y) is bounded (see e.g. Ethier and Kurtz, 
1986), which is true under assumption (A) ([f(y,x)]+/b(y) < 1). The biological 
interpretation of the fonction / as a fitness function becomes natural in view of this 
generator: because of the positive part function [■]+ in ©, the TSS process can 
only jump from a trait x to the traits x + h such that f(x + h, x) > 0. Therefore, 
the function f(y,x) measures the selective ability of trait y in a population made 
of individuals with trait x (see Metz et al., 1992, 1996). 

Our main result is: 

Theorem 1 Assume (A) and (B). Fix a sequence (uk)kgn in [0,1] N such that 

MV > 0, exp (-VK) <^u K < „ (9) 

A log A 

(where f(K) g{K) means that f(K)/g(K) —> 0 when K —> oo). Fix also 
x € X, 7 > 0 and a sequence of N -valued random variables ( 7 k)k&i, such that 
( 7 k/K)k&i converges in law to 7 and is bounded in L 1 . Consider the process 
(uf-,t > 0) generated by 0) with initial state (jk / K)6 X - Then, for any n > 1, 
e > 0 and 0 < ti < t? < ■ ■ ■ < t n < 00, and for any measurable subsets Ti,... ,T„ 
of X, 

lira P(Vi e (1,... ,n}, 3^6^: Supp(v£ /K ) = { 27 } 

K— J’+oo ‘ 

and IWu/kuk’ 1 ) ~^i\ < £ ) = p (V* e {i,...,n}, x ti eTi) ( 10 ) 

where for any v G Mf, Supp(^) is the support of v and (X t ,t > 0) is the TSS 
process generated by 0 with initial state x. 
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Remark 1 The time scale 1 /Kuk of Theorem^\is the time scale of the mutation 
events for the process v K (the population size is proportional to K and the individual 
mutation rate is proportional to uk). Assumption 0 ) is the condition leading to 
the correct time scales separation between the mutation events and the birth and 
death events. The limit El means that, when this time scales separation occurs, the 
population is monomorphic at any time with high probability, and that the transition 
periods corresponding to the invasion of a mutant trait in the resident population 
and the ensuing competition are infinitesimal on this mutation time scale. Observe 
also that this convergence result holds only for monomorphic initial conditions. We 
will make some comments on more general initial conditions in the next section. 

Corollary 1 Assume additionally in Theorem^ that (7 k/K)kgn is bounded in 
L p for some p > 1 . Then the process — 0 ) converges when K —> +00, 

in the sense of the finite dimensional distributions for the topology on A 4 f induced 
by the functions v 1—> (u, f) with f bounded and measurable on X, to the process 
(' Y t ,t > 0) defined by 

Y = I l6x i f t = 0 

t \ h Xt Sx t ift> 0. 

This corollary follows from the following long time moment estimates. 

Lemma 1 Assume (A) and that sup^ >1 E((z^, l) p ) < +00 for some p > 1 , then 

sup supE((^f r , 1} P ) < +00, 

K> 1 t> 0 

and therefore, if p > 1 , the family of random variables {{v ^, 1)}{_k>i, t>0} uni¬ 

formly integrable. 

Proof of Corollary E Let T be a measurable subset of X. Let us prove that 

lim E((z^ x l r )) = E(h A - t l{A- t er})- (H) 

K —► + OO 

Fix e > 0, and observe that h x £ [0,6/a]. Write [0,6/a] C where q is the 

first integer greater than b/ea, and = [(z — l)e, ie[. Define Tj = {.t £ X : h x £ L} 
for 1 < i < q, and apply m to the sets mri,...,mr g with n = 1, t\ = t and 
the constant e above. Then, by Lemma E there exists a constant C > 0 such that 

limsupE«i/^ jrtiJf ,l r )) < limsupE((i/^ UJf ,lr)l {<1/ jr ,i)<o})+® 

K —»+oo K —>-+oo ' UK 

Q 

< ^limsupE((i/ t ^ UK ,lmr i }l,i)<c})+£ 

K —>+00 ' UK 

Q 

^ ^ + i)^p (Xt g r n r*) + s 

2=1 
Q 

< y^(E(nx t l{x t 6rnr i >) + 2eP(X t £ Tj)) + e 

i= 1 

< E («x t l{x t er>) + 3e. 

A similar estimate for the lim inf ends the proof of CD. which implies the conver¬ 
gence of one-dimensional laws for the required topology. 

The same method gives easily the required limit when we consider a finite 
number of times t \,..., t n . □ 
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As suggested by the fact that the limit process Y is not continuous at 0 + , 
it is not possible to obtain the convergence in law for the Skorohod topology on 
B>([0, T], Mf)- More generally, we can prove: 

Proposition 1 For any s <t, the convergence of v k , Kuk to Y in Corollary^} does 
not hold for the Skorohod topology on B([s, t], M.f), for any topology on Mf such 
that the total mass function v i— > (u, 1 ) is continuous. 

Proof of Proposition [T] Assume the converse. Then, for some s < t. the total 
mass = {v^ Kuk ,1) converges for the Skorohod topology onD([s,t],l + ) to the 
total mass of the process Y. In particular, by Ascoli’s theorem for cadlag processes 
(cf. Billingsley, 1968), for any e > 0 and g > 0, there exists S > 0 such that 

limsupP(w'(Ai if , (5) > r/) <e, 

K —>+oo 


where the modulus of continuity to' is defined by 


S) := inf { max [U, t i+1 )) 

( i=0,...,r-l 

where the infimum is taken over all r £ N and all the finite partitions s = to < t\ < 
...< t r = t of [s,t] such that — ti > 6 for any i £ {0 ,..., r — 1}, and where 
■■= sup xyeI | <p(x) - ip(y) | for any interval I. 

Now, for any function ip £ D([s,t],R), < 2uj'(ip 1 S) + sup^g^^j \p(x) — 

p{x-)\ (cf. Billingsley, 1968), where u{<p,5) := sup a . iy g [Sit]i \ x - y \< 5 \v{x) - <p(y )|, 
and for any K > 1, sup^g^ j] \NfS — Nff_\ = 1/K. Therefore, for any e > 0 and 
?7 > 0 , there exists S > 0 such that 

limsupP(o;(A fif , 5) > rf) < e. 

K> 1 

This implies that the sequence ( N k )k is actually C-tight (cf. Billingsley, 1968) and 
that its limit is necessarily continuous, which is not true for (Y t , 1). □ 

3 Notations and outline of the proof of Theorem Q] 

We start with some definitions needed to explain the idea of the proof of Theorem^ 
and the precise meaning of assumption (B). 

Definition 1 

(a) For any K > 1, b, d, a > 0 and for any N/K-valued random variable z, we will 

denote by P K (b, d, a, z) the law of the N/K-valued Markov birth and death 
process with initial state z and with transition rates 

ib from i/K to (i + 1)/K, 

i(d + ai/K) from i/K to (i — 1 )/K. 

(b) For any I\ > 1, bk,dk,aki > 0 with k,l £ {1,2}, and for any N/K-valued 

random valuables Z\ and z-i, we will denote by 

Q K (bi, b’ 2 , di,d 2 , a n , an, £*21, £*22 , z\, Z 2 ) 
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the law of the (N/K) 2 -valued Markov birth and death with initial state 
(zi,Z 2 ) and with transition rates 


jb 2 


ib 


’i 


from ( i/K,j/K ) to ((i + 1)/K,j/K), 
from ( i/K,j/K ) to (■ i/K , (j + 1 )/K), 


i{di + an*/ A' + a V 2 j/K ) from (i/ AT, j/ K) to ((i - 1)/A', j/K), 
j(d 2 + 0-21 i/K + a 2 2 j/K) from (i/K, j/K) to (i/K, ( j - 1)/AT). 


These two Markov processes have absorbing states at 0 and (0,0), respectively. 
Observe also that, when a = 0, the Markov process of point (a) is a continuous-time 
binary branching process divided by K. 

Fix a: and y in A. The proof of the following two results can be found in 
Chap. 11 of Ethier and Kurtz (1986). 

Proposition 2 

(a) Assume p = 0 and v/f = N^(0)6 X . Then , for any t > 0, = Nf//(t)5 x , 

where N/f has the law P K (b(x), d(x), a(x, x), N^(0)). Assume N/f(0) —> 
n x ( 0) in probability when K —> +oo. Then, the sequence (Nff) converges in 
probability on [0, T] for the uniform norm to the deterministic function n x 
with initial condition n x (0 ) solution to 


h x = (b(x) - d(x) - a(x,x)n x )n x . 


( 12 ) 


(b) Assume p = 0 and = N//(0)5 X + N^(0)6 y . Then , for any t > 0, = 

N x (t)5 x + N//-(t)5 y , where (N x , N y ) has the law 



Assume N/f( 0) —> n x (0) and N^(0) —> n v ( 0) in probability when K —>■ +oo. 
Then, (N//,N y ) converges in probability when I\ —> +oo on [0, T] for the 
uniform norm to the deterministic function ( n x ,n y ) with initial condition 
(n x (0),n y (0)) solution to 


h x = (b(x) - d(x) - a(x, x)n x - a(x, y)n v )n x 
n y = (b(y) - d(y) - a(y,x)n x - a(y,y)n y )n y . 


(13) 


Note that, under assumption (A3), the logistic equation IT2l) has two steady 
states, 0, unstable, and h x , defined in 0, stable. The system m has at least 
three steady states, (0,0), unstable, (n x , 0) and (0 ,n y ). 

The assumption (B) of Section[21is the mathematical formulation of the impos¬ 
sibility of coexistence of two different traits, in the sense that, starting in the neigh¬ 
borhood of the equilibrium ( n x , 0) of system iTHfl) , either its solution converges to 
this equilibrium or to the equilibrium (0 ,h y ). More precisely, the following propo¬ 
sition follows fron an elementary analysis of system m (cf. e.g. Istas, 2000, pp. 
25-27): 

Proposition 3 If x and y satisfy w, then (n x , 0) is a stable steady state of ild\) . 
If x and y satisfy 0), then (h x , 0) is an unstable steady state, (0,h y ) is stable, and 
any solution to ll.W with initial state in (R!j_) 2 converges to (0 ,h y ) when t —> +oo. 

Let us now give the main ideas of the proof of Theorem ^ It is based on two 
main ingredients: first, when p = 0 and v/f is monomorphic with trait x, we have 
seen in Proposition [21 (a) the convergence of v K to n(t)S x , where n(t) is solution 
to m- Any solution to this equation with positive initial condition converges for 
large time to h x . The large deviations estimates for this convergence will allow us 
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to show that the time during which the stochastic process stays in a neighborhood 
of its limit (problem of exit from domain, Freidlin and Wentzell, 1984) is of the 
order of exp(AW) with V > 0. Now, when uk is small, the process v K with a 
monomorphic initial condition with trait x is close to the same process with y = 0, 
as long as no mutation occurs. Therefore, the left inequality in m will allow us to 
prove that, with high probability, the first mutation event (occuring on the time 
scale t/Kun) occurs before the total density drifts away from n x . 

The second ingredient of our proof is the study of the invasion of a mutant trait 
y that has just appeared in a monomorphic population with trait x. This invasion 
can be divided in three steps (Fig. Q, in a similar way as is done classically by 
population geneticists dealing with selective sweeps (Kaplan et al., 1989; Durrett 
and Schweinsberg, 2004): 


population size 



Figure 1: The three steps of the invasion of a mutant trait y in a monomorphic 
population with trait x. 


• Firstly, as long as the mutant population size ,ls y \) (initially equal to 
1/A') is smaller than a fixed small e > 0 (before t\ in Fig. 0), the resident 
dynamics is very close to what it was before the mutation, so 

stays close to n x . Then, the death rate of a mutant individual is close to 
the constant d(y) + a(y,x)n x . Since its birth rate is constant, equal to b(y), 
we can approximate the mutant dynamics by a binary branching process. 
Therefore, the probability that (u^,l^ y \) reaches £ is approximately equal 
to the probability that this branching process reaches eAT, which converges 
when K —> +oo to its probability of non-extinction [f(y,x)\+/b(y). 

• Secondly, once , l{ y }) has reached e, by Proposition Q (b), for large AT, 
v K is close to the solution to m with initial state (n x , e) (represented with 
dotted lines in Fig. QJ with high probability. By Proposition 0 this solution 
will be shown to reach the e-neighborhood of (0, n y ) in finite time fa in 
Fig. HJ. 

• Finally, once (v ^, l{ y }) is close to n y and {v ^, 1{ X \) is small, K(vf , l^}) can 
be approximated, in a similar way as in the first step, by a binary branching 
process, which is subcritical and hence gets extinct a.s. in finite time fa in 
Fig. 0. 

We will see in Sections n~2i and roi that the time needed to complete the first 
and third steps is proportional to log A', whereas the time needed for the second 
step is bounded. Therefore, since the time between two mutations is of the order of 
1/Kuk , the right inequality in J2J will allow us to prove that, with high probability, 
the three steps above are completed before a new mutation occurs. 
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Remark 2 As observed by Metz et al. (1996), the biologial heuristics leading to the 
TSS model extend to the case of polymorphic initial condition, where the population 
is composed of a finite number of distinct traits (see also Champagnat, 200)). Our 
mathematical method can also be extended easily to n-morphic initial conditions, 
except for one difficulty: one has to replace assumption (B) by another assumption 
stating that, for any n, any solution to the n-morphic logistic systems generaliz¬ 
ing Ef converges to an equilibrium (as in Proposition 0), and that the equilibria 
of these systems are non-degenerate, in the sense that the branching processes in 
the first and third steps above are not critical, or, equivalently, that a first-order 
linear analysis of these equilibria allows to determine their stability. Then, one 
could construct a polymorphic TSS model in which the number of coexisting traits 
is not fixed. However, the asymptotic analysis of n-dimensional logistic systems is 
non-trivial and may exhibit cycles or chaos, except when n = 1 or 2, and analytical 
assumptions ensuring the condition above are difficult to find. 

SectionEwill provide the large deviations and branching process results needed 
to make formal the previous heuristics. We will also prove several comparison 
results between (ijff-, 1} and the birth and death processes of Definition IU In 
Sectional the proof of Theorem0s achieved by computing, for any t, the limit law 
of Kuk according to the random number of mutations having occured between 
0 and t/KuK- 

Notations 

• [a] denotes the first integer greater or equal to a, and [aj denotes the integer 
part of a. 

• For any K > 1 and v £ At K , we will denote by P jf the law of the process v K 
generated by © with initial state v, and by E ff the expectation with respect 
to P* 

• The convergence in probability of finite dimensional random variables will be 
denoted by 

• We will denote by C(Z) the law of the stochastic process (Z t ,t > 0). 

• We will denote by ^ the following stochastic domination relation: if Qi and 
Q 2 are the laws of R-valued processes, we will write Qi F Q 2 if we can 
construct on the same probability space {SI, IF, P) two processes X 1 and X 2 
such that C(X l ) = Q, (i = 1, 2) and Vf > 0, Vo; £ Si, X}(uj) < X 2 (w). 

• Finally, if X 1 and X 2 are two random processes and T is a random time 
constructed on the same probability space as X 1 , we will write X) A X 2 
for t < T (resp. X 2 A X) for t < T) if we can construct a process X 2 on 
the same probability space as X 1 , such that £(X 2 ) = £(X 2 ) and Vf < T, 
Vw £ fl, X)(uj) < X 2 (w) (resp. X 2 (w) < X)(uj)). 

4 Birth and death processes 

We will collect in this section various results on the birth and death processes that 
appeared in Definition E 

4.1 Comparison results 

The following theorem gives various stochastic domination results. 
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Theorem 2 

(a) Assume (A). For any K > 1 and anyh 1 initial condition v(f of the process v K , 

C((v K ,l)) ^<P K (2b,0,a, (v£,l)). 

(b) With the same assumptions as in (a), let Af denote the number of mutations 

occuring in v K between times 0 and t, and let a,a\,a 2 > 0. Then, for 
t < inf{s > 0 : (vf, 1} > a}, 

A? A Bf, 

where B K is a Poisson process with parameter KukclS. 

If moreover vff = (vff, 1)<5 X , define ti = inf{< > 0 : A A = 1} (the first 
mutation time). Then, for t < n A inf{s > 0 : (vf, 1) ^ [ai, 02 ]}, 

Bf < Af A Cf, (14) 

where B K and C K are Poisson processes with respective parameter KuKa\p(x)b(x ) 
and KuKd 2 p(x)b(x). 

(c) Fix K > 1 and take b, d, a, z as in Definition Q] (a). Then, for any 61 , 62 , £3 > 0 

and any N/K-valued random variable £ 4 , 

P K (b, d + £ 2 , a + £ 3 , z) A P K (b + £i,d,a,z + £ 4 ). 

(d) Let (Z 1 , Z 2 ) be a stochastic process with law 

Q K (fii, 62 , di, d 2 , Oil, 012, 021,022, Zl, Z 2 ) 

where the parameters are as in Definition m (t). Fix a > 0 and define T = 
inf{t > 0, Z 2 > a}. Then, for t <T, 

M) A Z l t A Ml, 

where C(M X ) = P A (bi,d\ + aa 12 , an, Zi) 
and T(M 2 ) = P A (&i, di, On, zi). 

(e) Take ( Z Y ,Z 2 ) as above, fix 0 < ai < 02 and a > 0, and define T = infjt > 

0, Z 1 ^ [ai, 02 ] or Z 2 > a}. Then, for t <T, 

Ml A Zl < Ml, 

where C(M 1 ) = P A (b 2 , d 2 + a 2 a 2 i + aa 2 2 , 0, Z 2 ) 
and C(M 2 ) = P K (b 2 ,d 2 + aio 2 i, 0, z 2 ). 

Remark 3 Point (a) explains why it is necessary to combine simultaneously the 
limits K —► +00 and uk —> 0 in order to obtain the TSS process in Theorem 0] 

The limit K —> +00 taken alone leads to a deterministic dynamics (Fournier and 
Meleard, 2003), so making the rare mutations limit afterwards cannot lead to a 
stochastic process. Conversely, taking the limit of rare mutations without making 
the population larger would lead to an immediate extinction of the population in 
the mutations time scale, because the stochastic domination of Theorem n (a) is 
independent of uk and p(-), and because a process Z with law P K (2b, 0, a, jk/K) 
gets a.s. extinct in finite. 

Before proving Theorem |2 let us deduce from Point (a) the LemmaHstated in 
Section 0 
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Proof of Lemma |T) By Theorem 0 (a), it suffices to prove that 

sup supE((Zf : ) p ) < Too, 
k> 1 t> o 


where C(Z K ) = V K (26, 0, a, z/f) when sup if>1 E((;i4) p ) < Too. 
Let us define v / = P (Z^ = k/K). Then 


£k (*?r)-ElZ) =3- 


fc>i 


A' 


dvr 


= -W 

A'p ^ 


dt 


2b(k l)i;f 1 To 


fe>i 


(fc T l) 2 
A 


,,/c+l 


- fc ( 2b T a— ) u; 


= — y 

Ap ^ 


k> 1 L 


2b 


1+ iV-d + 4 


1-iV-i 


f + 1 4. 


Now, for k/K > 46/a, the quantity inside the square brackets in the last expression 
can be upper bounded by —26[3 — 2(1 — 1 /k) p — (IT l/k) p ], which is equivalent to 
—2bp/k when k —> Too. Therefore, there exists a constant k 3 that can be assumed 
bigger than 46/a such that, for any fc > ko, — 26[3—2(1 — l/k) p — (lTl/fc) p ] < —bp/k. 
Then, using the fact that (1 T x) p — 1 < x(2 p — 1) for any x £ [0,1], we can write 

k= 1 V 7 k>Kk 0 V 7 

< 26(2 P - l)jfeg T 6pfc£ - 6pE((Zf ) p ). 

Writing C = (2(2 P — 1) T p)k 3 /p, this differential inequality solves as 

E ((Z?) p ) <C + [E((z«) p ) - C]e -~ bpt , 

which gives the required uniform bound. □ 


Proof of Theorem [S| The proof is essentially intuitive if one computes upper 
and lower bounds of the birth and death rates for each processes considered in 
the statement of the theorem. We will simply give the explicit construction of 
the process v K , and the proof of m as an example. We leave the remaining 
comparison results to the reader. 

We will use the construction of the process v K given by Fournier and Meleard 
(2003): let (fl,.F, P) be a sufficiently large probability space, and consider on this 
space the following five independent random objects: 

(i) a M ^-valued random variable vff (the initial distribution), 

(ii) a Poisson point measure N\{ds, di, dv) on [0, oo[xN x [0,1] with intensity mea¬ 

sure gi(ds, di, dv) = bdsJ2k>i dk(di)dv (the birth without mutation Poisson 
point measure), 

(iii) a Poisson point measure N 2 (ds, di, dh, dv) on [0, oo[xN xl 1 x [0,1] with in¬ 

tensity measure < 72 {ds, di, dh, dv) = bds X)fc>i &k{di)m(h)dhdv (the birth with 
mutation Poisson point measure), 

(iv) a Poisson point measure N 3 {ds, di, dv) on [0, oo[xN x [0,1] with intensity mea¬ 

sure q 3 (ds,di,dv) = d ds^2 k>1 Sk(di)dv (the natural death Poisson point 
measure), 
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(v) a Poisson point measure N^ds, di, dj, dv) on [0, oo[xN x N X [0,1] with intensity 
measure q±{ds,di,dj,dv) = iot/K)ds^2, k>1 5k{di)^2 m>1 8 rn ( K dj)dv (the com¬ 
petition death Poisson point measure). 


We will also need the following function, solving the purely notational problem 
of associating a number to each individual in the population: for any K > 1, let 
H = (if 1 ,..., H k ,...) be the map from A4 K into (K*) N defined by 


H 




^ ^ I (*£<j( 1) ) * • ■ ) X a (n) 5 0, . . . , 0, . . .), 


t=l 


where =<!...=<! a ' a r n ) for the lexicographic order A on Mb For convenience, we 
have omitted in our notation the dependence of H and H l on K. 

Then a process v K with generator L K and initial state Vq can be constructed 
as follows: for any t > 0, 


= "o + 




f< 


1 {i<K(v?_, 1»- ft 

[i] Ni(ds, di, dv) 


y 






I< 


if 

< V S 


W 


-1 


l {i<K(v*_, l)}' 


K 


N 3 (ds,di,dv) 




(«*_)) m(H i (i N- 2 (ds, di, dh, dv) 

y 

i AT 4 (ds, di, dj, du). 




(15) 


Although this formula is quite complicated, the principle is simple: for each type 
of event, the corresponding Poisson point process jumps faster than v K has to. 
We decide whether a jump of the process v K occurs by comparing v to a quantity 
related to the rates of the various events. The indicator functions involving i and 
j ensures that the and j th individuals are alive in the population (because 
K(iy^, 1) is the number of individuals in the population at time t). 

Under (Al), (A2) and the assumption that E((z/g f ,l)) < oo, Fournier and 
Meleard (2003) prove the existence and uniqueness of the solution to (ITsl) . and 
that this solution is a Markov process with infinitesimal generator < 0 . 

Now, let us come to the proof of o. The process A K can be written as 


A? := 


'-{i<K{v*_, !>}' 


X 1, 


u k h(h*(v k _))6 (h»(i/^_)) I A^ 2 (ds, di, dh, dv). 


Kwr 


In the case where = (uq , 1 )5 X , as long as t < t-\ , vf = {vf , 1 )8 X . Therefore, 
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for t < ri A inf{s > 0 : {vf, 1) 0 [ai, a 2 ]}, 


If 


I I I I 11 f UK/J.(x)b(x) m(x,h) 1 -^2 {dS, di, dh, di]') ^Af 

Jo JnJw Jo r-—*-^r; 

< / / / / l{»<Ka 2 }lr mM l N 2 (ds,di,dh,dv). (16) 

Jo In it 1 Jo l - 6 / 


Since the intensity measure of ./V 2 is 


q 2 (ds, di , dh , du) = b ds 5k(di)m(h)dhdv, 
k> 1 

the left-hand side and the right-hand side of TO are Poisson processes with pa¬ 
rameters KuK<iin{x)b{x) and KuK<i 2 fx{x)b{x ), respectively. □ 


4.2 Problem of exit from a domain 

Let us give some results on P K (b,d,a,z) when a > 0. Points (a) and (b) of the 
following theorem strengthen Proposition |2j and point (c) studies the problem of 
exit from a domain. 

Theorem 3 

(a) Let a,T > 0 and b,d> 0, let C be a compact subset of R)j_, and write P? = 
P K {b, d, a, z) for z € N/K. Let (f> z denote the solution to 

4> = (b — d — a<p)(j) (17) 

with initial condition (j> z { 0) = 2 . Then 

r := inf inf \<j> z (t)\ > 0 and R := sup sup \/> z {t)\ < + 00 . 

zee 0 <t<T zeC 0 <t<T 

Moreover, for any 6 < r, 


lim sup Pf ( sup \w t — > S ) = 0, (18) 

K ^+00 zeC \ 0 <t<T ) 

where Wt is the canonical process on B(R + , R). 

(b) LetT,aij > 0 and bi,di > 0 (i,j G {1,2}), let C be a compact subset of (R+) 2 , 
and write Qf >Z2 = Q K {bi,b 2 , di,d 2 , an, a i2 , a 2 i, a 22 , zi , z 2 ) for 21 and z 2 in 
N/K. Let <f) Zl , Z2 = (</>f z 2 ’^zi 22 ) denote the solution to 

f 0 1 = (h - di- an^ 1 - ai20 2 )</> 1 
\ </> 2 = (62 - d 2 - a 2 i<(i 1 - a 22 0 2 )(/) 2 


with initial conditions </>} ljZ 2 (0) = 21 and ^ 2 1 >Z 2 (0) = z 2 . Then 

r := inf inf \\4> Zl , Z2 (t)\\ > 0 and sup sup ||0 Zl , Z 2 (t)|| <+ 00 . (19) 

zee 0 <t<T 2gC 0 <t<T 

Moreover, for any S < r, 

lim sup Qf , 2 ( sup \\w t - <t > z 1 ,z 2 (dll > 6) = 0, 

K^+oc ze c 0 <t<T 

where Wt = {w},w/) is the canonical process on B(R + ,R 2 ). 
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(c) Let b, a > 0 and 0 < d < b. Observe that (b — d)/a is the unique stable steady 
state of O- Fix 0 < rj\ < (b — d)/a and 772 > 0, and define on B(R + ,R) 


T k = inf jf > 0 : w t ^ 

Then, there exists V > 0 such that, for any compact subset C of](b — d)/a — 
iji,(b- d)/a + rj 2 [, 

lim supPf {T k < e KV ) = 0. (20) 

K->+ 00 zeC 


b-d 


b — d 


Vi, 


a 


V 2 


Proof of (a) and (b) Observe that any solution to HO) with positive initial 
condition is bounded < 0 as soon as <j> > (b — d)/a). This implies that R < 00 . 
Moreover, a solution to can be written as 

(f>(t) = ^(0) exp (/ (b — d — ^(s^ds^ > 0(0) exp((6 — d — aR)t), 
which implies that r > 0. 

Equation m is a consequence of large deviations estimates for the sequence of 
laws (Pf)x>i- As can be seen in Theorem 10.2.6 in Chap. 10 of Dupuis and Ellis 
(1997), a large deviations principle on [0,T] with a good rate function It holds for 
Jj/K --valued Markov jump processes with transition rates 

Kp(i/K) from i/K to (i + 1 )/K, 

Kq(i/K) from i/K to (i — 1 )/K, 


where p and q are functions defined on R and with positive values, bounded, Lips- 
chitz and uniformly bounded away from 0. The rate function It writes 


/ T (</>) 


L{4>{t),4>{t))dt 


Jo 

+00 


if is absol. cont. on [0,T] 
otherwise 


( 21 ) 


for some function L : R 2 —> R + such that L{y, z) = 0 if and only if z = p{y) — q(y). 
Therefore, It{4>) = 0 if and only if (f> is absolutely continuous and 

$ = ( 22 ) 


Moreover, this large deviation is uniform with respect to the initial condition. 
This means that, if Rf' denotes the law of this process with initial condition z, for 
any compact set C C R, for any closed set F and any open set G of D([0, T], R), 

liminf—log inf R5(G) > — sup inf It('4A (23) 

if^+oo K zee zK i/>(o)=z 

and limsup — logsupR5(F) < — inf ItM- (24) 

K^+oc A z gC TpeF, V>(0)6 c 

Our birth and death process does not satisfy these asumptions. However, if we 
define 


P{z) = b X(z) and q(z) = d\{z) + ay(z) 2 , 
where x( z ) = z if z € [r — 5, R + <5]; r — 5 if z < r — 5; R + 5 if z>R +5, 
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then R A = Pf on the time interval [0, r], where r = inf{t > 0 ,wt [r — 5, R + <5]}, 
and p and q satisfy the assumptions above. Therefore, by ED. 

limsup log sup P{' ( sup \wt — (pz{t)\ > <5) < — inf where 

K — >4~oo A zS C ~ \0 <t<T ) ipGF 5 

F s := {ip G D([0,T],R) : V'(O) G C and 3t G [0,T], \ip{t) - (p^( 0 ){t)\ >6} 

By the continuity of the flow of 12211 (which is a classical consequence of the fact 
that z i —> p(z) — q(z) is Lipschitz and of Gronwall’s Lemma), the set F s is closed. 
Since It is a good rate function, the infimum of It over this set is attained at some 
function belonging to F s , which cannot be a solution to ED. and thus is non-zero. 
This ends the proof of ED- 

The proof of (b) can be made in a very similar way. □ 


Proof of (c) Define the function \ on R by x( z ) = z if z G [(6 — d)/a — pi, (b — 
d)/a + 772 ], \{z) = (b — d)/a —pi for z < (b — d)/a — 1 71 and x( z ) = {b~ d)/a + i 72 
for z > {b — d)/a — p 2 . As in the proof of (a), we can construct from the functions 
p(z) = bx{z) and q{z) = dx{z) +ax(z) 2 a family of laws (RF) such that R^ = Pf 
on the time interval [0,T A ], and such that 12311 and 1241 hold for the good rate 
function It defined in <ED. 

Observe that any solution to 1221 are monotonous and converge to (b — d)/a 
when t —> Too. Therefore, the following estimates for the time of exit from an 
attracting domain are classical (Freidlin and Wentzell, 1984, Chap. 5, Section 4): 
there exists V > 0 such that, for any <5 > 0, 

lim inf R k (e K ^ <T K < e K ^ +5 A = 1, 

which implies ED if we can prove that V > 0. 

The constant V is obtained as follows (see Freidlin and Wentzell, 1984, pp. 
108-109): for any y, z G R, define 


V{y,z) 


inf 

t> 0, ip(0)=y, <p(t)=z 


Then 


V := V 


f b-d b-d 

\ ot ’ a 



f\V 


b-d 

a 


b-d 

a 



Now, Theorem 5.4.3. of Freidlin and Wentzell (1984) states that, for any y,z G 
R, the infimum defining V(y,z) is attained at some function cp linking y to z, in 
the sense that, either there exists an absolutely continuous function <p defined on 
[0, T] for some T > 0 such that <p( 0) = y, <p(T) = z and V(y,z) = It(<P) = 
J q T L(<p(t), (p{t))dt, or there exists an absolutely continuous function <p defined on 
] — 00 , T] for some T > —00 such that lim^-oo <p(t) = y, <p(T) = z and V(y, z) = 
S-^L{(P{t),^{t))dt. 

Since any solution to ED is decreasing as long as it stays in [(b — d)/a, Too[, 
a function <p defined on [0,T] or ] — 00 , T] linking ( b — d)/a to ( b — d)/a + 772 
cannot be a solution to ED. and thus V((b — d)/a, (b — d)/a T P 2 ) > 0. Similarly, 
V((b — d)/a , (b — d)/a — p\) > 0, and so V >0, which concludes the proof of 
Theorem |D □ 


4.3 Some results on branching processes 

When a = 0, P A (6, d, 0, z) is the law of a binary branching process divided by K. 
Let us give some results on these processes. 
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Theorem 4 Let b,d > 0. As in Theorem 0 define, for any K > 1 and any 
z £ N/A", Pf = P K {b, d, 0, z). Define also, for any p £ R, on D(R+,K), the 
stopping time 


T p = inf{t > 0 : wt = p}- 

Finally, let (tx)x >i be a sequence of positive numbers such that log A -C tx- 
(a) If b < d (sub-critical case), for any £ > 0, 


lim Pi/ k (T 0 < tx A T^xyx) = 1, (25) 

and lim Py eK \ /K { T o < t K ) = 1- (26) 

K-*-\-oo L J/ 

Moreover, for any K > 1, k > 1 and n > 1, 

Pn/K(T kn/K < To) < i (27) 


(b) If b > d (super-critical case), for any £ > 0, 


Inn p i/if (^o < t K A T’fejq/^) = - 

K—»+oo 7 0 

and lim PfVxC^Wl/is: < tic) = 1 - 7- 

K —>+00 ' 0 


(28) 

(29) 


Proof Let us denote by Q„ the law of the binary branching process with initial 
state n £ N, with individual birth rate b and individual death rate d. Then < 1251 ) . 
®, 03. ® and rewrite respectively 

lim Qi(T 0 < tx A T^ sK ]) = 1, (30) 

K->--\-oo 

lim Q LeKJ (T 0 < t K ) = 1, (31) 

K —>+00 

Q n(Tkn < To) < 7 , (32) 

k 

lim Qi(T 0 < tx A Tf^) = 7 (33) 

K^+oo 1 1 b 

and lim Qi(Tr eif -i < t K ) = 1 - 7 - (34) 

X—v+oo 1 1 0 

The limit 13111 follows easily from the distribution of the extinction time for 
binary branching processes when b ^ d (cf. Athreya and Ney, 1972, p. 109): for 
any t > 0 and n £ N, 

QnW <0=1 l_ de . lb . d)t ’ \ • 05) 

Since tx —> +00, Qi(T 0 < tx A T^ eK -\) —► Qi(T 0 < 00), which gives iTTUll 
and 13311 (the probability of extinction of a binary branching process can be recov¬ 
ered easily from 1351) ). 

The inequality l|32l) follows from the fact that, if (Z t ,t > 0) is a process with 
law Q„, (Z t exp(— (b — d)t),t > 0) is a martingale (cf. Athreya and Ney, 1972, p. 
111). Then, Doob’s stopping theorem applied to the stopping time To yields, 

E„(fcne (d_f ' )Tfe ”l {Tfcii<To} ) =n, 
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where E„ is the expectation with respect to Q„. Therefore, when b < d, knQ n (Tk n < 
To) < n, and the proof of is completed. 

The limit m follows from the fact that, if ( Z t ,t > 0) is a branching process 
with law Qi, the martingale ( Z t exp(— (b — d)t), t > 0) converges a.s. when t — > +oo 
to a random variable W, where W = 0 on the event {To < oo} and W > 0 on 
the event (T 0 = oo} (cf. Athreya and Ney, 1972, p. 112). Hence, on the event 
{To = oo}, when b > d, 

(log Z t )/t —y b — d > 0. 

Therefore, since logA' <C tx, for any e > 0, Qi(T 0 = oo, T^ eK -\ > tx) — 5 • 0 when 
K —> +oo. Then, m follows from the fact that Qi(Tq = oo) = 1 — d/b. □ 


5 Proof of Theorem |T| 

Let us assume, without loss of generality, that u K is constructed by m on a 
sufficiently large probability space (H,T, P). 

We introduce the following sequences of stopping times: for all n > 1, let r„ be 
the first mutation time after time r n _i, with To = 0 (i.e. r„ is the n th mutation 
time), and for any n > 0 , let 9 n be the first time after r n when the population 
gets monomorphic. Observe that 0q = 0 if the initial population is monomorphic. 
For any n > 1, define the random variable U n as the new trait value appearing at 
the mutation time r„, and, when 9 n < oo, define V n by Supp(^) = {V n }. When 
9 n = +oo, define V n = +oo. 

Our proof of Theorem Q] is based on the following two lemmas. The first lemma 
proves that there is no accumulation of mutations on the time scale of Theorem 0 
and studies the asymptotic behavior of t\ starting from a monomorphic population, 
when K —> +oo. 


Lemma 2 

(a) Assume that the initial condition of v K satisfies sup K , 1)) < +oo. Then, 

for any r\ > 0 , there exists £ > 0 such that, for any t > 0 , 


lim supP^Sc ( 3n > 0 : 

K-t+oo V ° \ 


t 


<T n < 


t T £ 


< V- 


Kux KuxJ 

Let x € X and let (zx)x>l be a sequence of integers such that Zx/K —» z > 0. 
(b) For any £ > 0, 


(36) 


lim P tfc, n > log AT, sup \{v *, 1) - n x \> £ 

K^+OO K * ^ te[log is:,Ti] J 

Since log A' <C l/Kux, by (a) with t = 0, 


= 0 . 


(37) 


>K 

Z K 

K —>+oo K 


lim P(n < log AT) = 0. 


In particular, under P±x s , v^gx n x$x and —> n x 5 x . 

If, moreover, z = h x , then, for any £ > 0, 


lim P ±k s sup \{y t , 1 ) - n x \ > £ = 0 . 

K^+oo TT \ te[0,Ti] J 


(38) 
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(c) For any t > 0, 


lim P x \ti > — ] = exp (—/3(x)t), 

K ^ +00 -rr s * ^ Ku k J 

where /?(•) has been defined in J2j). 

The second lemma studies the asymptotic behavior of 9q and Vo starting from a 
dimorphic population, when K —> +oo. 


Lemma 3 Fix x, y € X satisfying B) or 0. and let (zk)k >i be a sequence of 
integers such that zk /K —► h x . Then, 


lim P? f 

K ^+oo 


s x +±s,.(yo y ) 


[f(y,x)]~ 


!im p iK 5x+ i s (V 0 =x) = 1- 


K^+oo -K~°x'-K°y 


Hy) 

[f(y,x)]+ 

b(y) 


V77>0, Jim P;* 5+ i, I0 O > 


and Ve > 0, lim P ±k s+ _l s (\(u^ 1) - n Vo \ < e) = 1, 

K->-\-oo k ° x ^ k°v v u 7 


K^+oo —°x^ — o y 

r + 

where f(y,x) has been defined in 0. 
Observe that m implies in particular that 

lim P K 


Kuk 


A ti =0 


tilt (9q < Tl) — 1. 

K ^+oo ~[C°* + T< s y y ' 


The proofs of these lemmas are postponed at the end of this section. 


(39) 

(40) 

(41) 

(42) 


Proof of Theorem |T| Observe that the generator A, defined in JHJ, of the TSS 
process ( X t ,t > 0) of Theorem |T| can be written as 

Aip{x) = / (<p(x + h) — g}{x))f5{x)n(x, dh), (43) 

Jr 1 

where the probability measure k(x, dh) is defined by 


k(x , dh) 



[f(x + y,x)] + 
b{ x + v) 


m(x, v)dv 


S 0 {dh) 


[f(x + h,x)\ + 
b(x + h) 


m(x, h)dh. 


(44) 


This means that the TSS model X with initial state x can be constructed as 
follows: let ( Z(k ), k = 0,1, 2,...) be a Markov chain in X with initial state x and 
with transition kernel n(x,dh), and let ( N(t),t > 0) be an independent standard 
Poisson process. Then, the process (X t ,t > 0) defined by 


X t := Z (n Qf P{X s )ds 

is a Markov process with infinitesimal generator fnfll (cf. Ethier and Kurtz, 1986, 
Chap. 6). Let P x denote its law, let (T n ) n > l denote the sequence of jump times of 
the Poisson process N and define (S„) n >i by T n = J Q n /3(X s )ds. By (Al) and (A3), 
/?(•) > 0, and so S n is finite for any n > 1. Observe that any jump of the process 
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X occurs at some time S n , but that all S n may not be effective jump times for X, 
because of the Dirac mass at 0 appearing in i®. 

Fix t > 0, x G X and a measurable subset T of X. Under P^., Si and Xs\ 
are independent, S± is an exponential random variable with parameter /3(x), and 
Xs L — x has law n(x,-). Therefore, for any n > 1, the strong Markov property 
applied to X at time Si yields 

P*(S n < t < S n+ i, X t G T) 

= [ (3{x)e~^ s f P x+/i (S„_i <t — s < S n , X t _ s G T)K(x,dh)da. (45) 
Jo Jr 1 

Moreover, 

Px(0 < t < Su X t G r) = l {xer} e-^ x)t . (46) 

The idea of our proof of Theorem ^ is to show that the same relations hold 
when we replace S n by r n and X t by the support of vfj KuK (when it is a singleton) 
and when K —*■ +oo. 

More precisely, fix x G X, t > 0 and a measurable subset F of X , and observe 
that 

G r : Supp( 

u t/Ku K ) = M, I Wt/Ku K > ^y I ^ ^ }=u 4f(t, r,e), (47) 

n> 0 

where 

A n {t., r, e) '.= 0 n < j^ Uk ^ r «+ 1 > V n GT, | (v t / KuK > ~ ™V n I < e ^ ■ 

Let us define, for any zGM and n > 0, 


Pn (t, x, T, e, z) := P^ Sx ( 9 n < -j-J— < r n+ 1, V n G T, 


Ku k 


sup \{vf , 1 ) -nyj <E 

se[e„,r„ + i] 


and 


% it,x,T,e,z) :=P * S: 


t 


< ri, Vo e r, sup \(vf , 1) - nvbl < e 

Ku k seiiogif.n] t 


= 1 


T3 A’ 

f*er }F* S: 


t 


Ku k 


<n, sup |(i/f, 1) - n x \ <e . 

SS[logif,Ti] / 


Let us also extend these definitions to e = oo by suppressing the condition involving 
the supremum of \{v K , 1) — nv n \- 
Then 


Lemma 4 

(a) For any x G X, n > 1, t > 0, e G]0, oo] and for any sequence of integers (zk ) 
such that Zk/K —► z > 0, p n (t,x,T) := ]imK-t+ooP%(t,x,T,e,ZK) exists, 
and is independent of ( zk ), z and e. 

Similarly, po(t, x, T) := lim/f^ +oc qff (t, x, T, e, zk) exists, and is independent 
of (zk), z and e, and, if z = h x , limx-^+oo Po A x i P> e > z k) exists and is also 
equal to Po(t, x, T). 

Finally, if we assume that (zk) is a sequence of N-valued random variables 
such that zk/K converge in probability to a deterministic z > 0, then the 
limits above hold in probability (with the same restriction that z has to be 
equal to n x for p(f). 
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(b) The functions p n (t, x, T) are continuous with respect to t and measurable with 
respect to x, and satisfy 


Po 


(t, x, T) = l{ xgr }e and Vn > 0, 


p n+ i(t,x,T) = f (3(x)e f p n (t — s,x + h,T)n(x,dh)ds. (48) 

Jo Jr 1 

Let us postpone the proof of this lemma after the proof of Theorem ^ 

Observe that, because of 63 and (E(l . LemmaEl(b) implies that P^(5' n < t < 
S n +i, X t gT) = p n (t,x,T). 

Now, let denote the law of the process v K with random initial state v. Since 

v K is Markov, P* k/KSx = E[P* k{u)/KSx ]. By 63, 


(3y e T : Supp(v? /KUK ) = {y}, 


'■ / t C /Ku K 


)!)-%! < e ) ='52^¥-6 x ( A Z( t ’ r ’ £ ))’ 


n> 0 


where {“Jk) is the sequence of N-valued random variables of Theorem^ 

For any K > 1 and n > 1, 

P%(t, x, T, e, y K ) < P^ s T,e)) <p%(t,x,T, oo,y K ), 

K x 

and qg(t,x,T,e, 7 if) < P^ g JA^{t,T,e)) <p%(t,x,T, oo,y K ), 

so, by Lemma 01(a), for any n > 0, P ^ k/k)Sx (A%( t,T, s)) ^ p n (t,x,T), and 
therefore, 

K l j^ oc ^hK/K)sA A n(^ T ^ £ )) =Pn(t,X,T). (49) 


Now, by 63^ for any K > 1, 


+ OO 

Et 

n—0 


p (K (t, r, e)) + p' 4^ «(t, r c , £ )) 


<1, 


(50) 


where T° denotes the complement of T. Moreover, J2n=o bn(L £,T)+p n (f, x, T c )] = 
1. Therefore, for any 77 > 0, there exists no such that 

n 0 

Y^[Pn(t,X,T) +p n (t,x, T c )] >1 — 77- 

n —0 

Then, one can easily deduce from 63 and 63 that 
limsup + 

K^ + OC L K * K x 


n>no 


< v, 


from which follows, by 63. that 
(By GT : Supp( 

^t/K ilk ) = M, yer, IWF/kuk’ 1 ) ~ n y\ < s ) 
= Yj pn ^' x ' T ^ = p *( x * e r )> 


lirn P%k s 

K^ + OO K ° x 


n> 0 


which is 03 in the case of a single time t. 

In order to complete the proof of Theorem ^ we have to generalize this limit 
to any sequence of times 0 < t\ < ... < t n . 
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We will specify the method only in the case of two times 0 < t\ < £2- It 
can be easily generalized to a sequence of n times. We introduce for any integers 
0 < rn < n 2 the probabilities 


Pn\ ,n 2 (^1 ’ ^2 5 1^1 J 1^2 > %) 


:= Pfj, ( Om < -p— < Tm+ 1 , V ni e Ti, sup \{fX,l)-n V \<£, 

K V Kuk sZ[0 ni ,T nx+1 \ 

@n 2 ^ "TT" ^ ' 7 ”n2+l? V n2 G r 2 and sup 1) - ny, | < £ ] 

KU K «6[fln a ,T„ 2 + 1 ] / 


and 


Qo,ri2 (^15 ^2 5 *£? Tl, r 2 )£) 2) 

:= l^er^Pf 5a , I 7 ^—< n, sup |(^f, 1 ) - fi x | < £, 

* 5 € [log iC,n] 

^ri2 — T7" ^ ' 7 ”ri2+l5 I4 2 G r 2 and sup 1) - n v . \ < £ ] . 

Ku k se[e n2 , r„ 2+l] 7 

Then, we can use a calculation very similar to the proof of Lemma 0 to prove 
that, as K —s- +00, Pn 1>n2 (ti,t 2 , x, Id, T 2 , e, zk) converges to a limit p niiU2 (ti, f 2 , x, Id, T 2 ) 
independent of e G]0, 00], Zk and the limit z > 0 of Zk/K (with the restriction 
that z has to be equal to n x if n\ = 0), and that lim q^ n2 (ti, t 2 , x, Ti, T 2 , e, z) = 
Po : n 2 (ti,t 2 ,x,Ti,T 2 ), where 


Po.n 2 (ti,t 2 ,a:,ri,r 2 ) = l{ xeri }e ^ x)tl Pn 2 -ti,x,T 2 ); 

Pn i + l,n 2 +l(tl, ^2, X, Ti, T 2 ) 

= f /?(x)e” /3(x)s f p ni ,n 2 (ti - s,t -2 - s,x + /i,Ti,r 2 )ft(x, dh)ds. 

Jo Jr 1 


As above, we obtain equation CQ9 for n = 2 by observing that the same relation 
holds for the TSS process X. 

This completes the proof of Theorem El □ 


Proof of Lemma El First, let us prove that the convergence of p^"(t, x, T, £ , zk) 
when zk £ N in Lemma 111 (a) implies the convergence in probability of these 
quantities when zk are random variables: if ( zk ) is a sequence of random variables 
I V 

such that Zk/K —> z, by Skorohod’s Theorem, we can construct on an auxiliary 
probability space S2 a sequence of random variables ( zk ) such that C(zk) = C{zk) 
and zk{CJ)/K —> z for any to G Cl. Then, limpet, x, T, e, zk{x>)) = p n (t,x,T) for 
any to G Cl, which implies that p%(t,x,T,e, zk) p ra (t, x, T). The same method 
applies to (t, x, T, £, z*,). 

We will prove LemmaEl(a) and (b) by induction over n > 0. 

First, when t > 0, it follows from the fact that t/Kux > log I\ for sufficiently 
large K, and from Lemma 0(b) and (c), that 

lim qg(t, x, T, £, z K ) = l{xer}e _/3(x)t , 

K —»+oo 

and that, if z = n x , 

lim pg(t,x,T,e,z K ) = l {xe r}e _/3(x)t . 

K —>-+00 


22 








Then, fix n > 0 and assume that Lemma 01 (a) holds for n. We intend to prove 
the convergence of p^ +1 (t,x,T,e, Zk) to p n+ i(t,x,T) satisfying (E%ll by applying 
the strong Markov property at time Ti, in a similar way as when we obtained (TTsli . 
However, the convergence of (t,x,T,e, zk) to p n (t,x,T) only holds for non- 
random t. Therefore, we will divide the time interval [0, t] in a finite number of 
small intervals and use the Markov property at time T\ when t± is in each of these 
intervals. Moreover, we will also use the Markov property at time 6\ and we will 
use the fact that U\ is independent of t\ and ;/()"_ and that C7i — x is a random 
variable with law m{x , h)dh. 

Following this program, we can bound p^ + 1 (f, x : T, s, zk) from above as follows: 
fix 77 > 0; using Lemma |3 (a) in the first inequality, for sufficiently large k > 0 and 
K > 1, 


Pn+l (t,X,T,£,Z K ) < P lK_ Sx {^n+1 < 


r*2 fc i-i 

< E p 


i =0 


K 

Tr 5 * V 2 k Ku 


<Ti < 


K 


I<u K 

i + 1 
2 k I<u 


t t + 2/2 k 

) X n+ 2 > — p -) Vn+1 G r I + 77 


K 


5 d n -\-l ^ 


Kuk 

t 


Ku k ’ 


t + 2/2 k 

Tn+2 > —p —— and 14+1 G r ) + 77 


I\u K 


\ti k l-i 

< E E 


K 


i =0 


h 


iK 


i±>_r - 


t-i/2 k 

Ku k 


t-(i- l)/2 k 

t„+ 1 > -—- and 14 G T 


j"t2 fc "| —1 

s E 

2=0 


Ku k 


L 1 ! 




h 


P k k I o n < 


t-i/2 k 
Ku k 


< T n+ 1, 14 G r ) )m(x,h)dh 


+ p. 


\t2 k ~\ —1 

i E E +i 

2=0 


c K ~ -^K.u k 

v K(. • /o k 


is- 1 + {so>prkpp^} 


If fpK(t-i/2 k ,V 0 ,r,oo,K(iyg ) ,l))\m(x,h)dh + 77 . (51) 

\ 6o< Wl^^ Tl ) / J 


Now, since z/4_ = (z4f , 1)4, under P k k , , x , on the event {6 0 < ti}, 

1 1 ^ T1 _+ K Ox + h 


Pn(t - i/ 24 Vo,r, 00 , K(v£ 0 , 1 )) = 1 {Vo = x} Pn (* - */2 fc , x,T, OO,K(v? 0 ,l)) 


K ( 


K 


+ l {Vo=x+ h}P n (t - i/2 ,x + h,r,oo,K(v eo ,l)). (52) 


By Lemma0](b), v/f Z n x 5 x under P^r , so we can use Skorohod’s Theorem 

to construct random variables Nk on an auxiliary probability space 12 with the same 
law that (z4)4,1) and converging to n x for any u G 12. 


Fix u> G 12. Under P 


K 

N K (i2j)S x + ^S x+h '’ 


define 


Z 


K 

1 


{ v e 0 i l)l{vb=x. 


®0<Tl} 


\Kn x 1 
K 


l{V o ^x}U{0o>Ti}- 


It follows from Lemma01 1141 li and m . and from assumption (B) that, for Lebesgue 
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almost every h, Zf n x , so, by the induction assumption, under PA . , , 

1 * K\OJ)Ox i Ox-\-h 

Pn(t - i/2 k ,x,T,oo,KZ*) Z Pn (t - i/2 k ,x,T). 

Now, given two sequences of uniformly bounded random variables (Xk)k>i 
and (Y k )k>i such that Xk and Yk are defined on the same probability space for 
any K > 1, and such that, when K —> +oo, Xk converges in probability to a 
constant C and limA'E(Yff) exists, it is standard to prove that 


lim E {X k Y k ) = C lira E (Y K ). 

K —►H-oo K —^+oo 


(53) 


Applying this with X K = p%(t - i/2 k , x, F, oo, KZ?) and Y K = l{v 0 =x, 0 o <n}, 
by Lemma mm and GD and assumption (B), for Lebesgue almost any h 1 and 
for any ui € Cl, 


lim E{> , i , 

K—> + OO JVK(U)Ox + T< d - t + h 


(l{Vo=», Oo<Ti}P%(t - i/2 k ,x,r,<x>,K(i'0 o ,l})) 
[f{x + h, x)]+' 


= 1 - 


b(x + h) 

Finally, we obtain that, for Lebesgue almost any h, under P% K x , 


p n {t - i/2 k ,x,T). 


E % _+*«.+* ( 1 {Vo=x, e 0 <T 1 }Pn(t-i/2 k ,x,T,oo,K{v$ 0 ,l))) 


1 - 


[f(x + h,x)]. 


b(x + h) 

Similarly, we can use Lemma, Idl 13911 and the random variable 

%2 = 1)1 {V 0 =x+h, e 0 <Ti} + fia:+/il{Vb^s+/i}U{eo>T 1 } 

to prove that, for Lebesgue almost any h, under P l z K ~ , 

fc/ Ox 


p n (t-i/2 k ,x,r). (54) 


E ^K_ + ^ Sx+h (l{Vo =x+h, e 0<Tl }P%{t - */ 2 fc , x + h, r, oo, K(u^ o , 1))) 


v _ [f(x + h,x)]+ 
b(x + h) 


p n (t - i/2 k ,x + h,T). (55) 


Moreover, by Lemma HI ITTll . for Lebesgue almost any h, under P f ZK /K)Sx 


p K 

l'f 1 - + K^+h 


do > 


2 k I<u 


K 


An -A 0. 


(56) 


Collecting these results together, applying iP|) again, it follows from Lem¬ 
ma 0 (c) and jnn that, for Lebesgue almost any h, 


K- 


( 


lim ~E% k x If l iic (if ^ 

p«(t-i/2 k ,Vo,T,cx>,K{vl,l)) 

[f(x + h, x)]+_ 


If 'l \ 


_ ( c -m 


= e 'a* — e 


-/3(x)2 


L ) 


b( x + h) 


Pn{t - i/2 k ,x + h,T) 


24 
















Finally, taking the integral of both sides with respect to m(x , h)dh , the domi¬ 
nated convergence theorem and <ED yield 

limsup pK +1 (x,t,T,£,z K ) 

K —>-+oo 

\t2 k ]-l 

< E 

2=0 

Taking the limit k —► +oo first and then 77 —> 0, it follows from the fact that 

e -0(x)i/2 k _ e -/3(x)(i+l)/2 fc = e -/3(x)i/2 fc ^( a ,)/ 2 fc + Q(l/2 2k )) 

and from the convergence of Riemann sums that 

limsup Pn+i{ x i t,T,e,ZK) < / P(x)e~ l3( ' x ' )s / p n (t — s, x + h,T)n(x, dh)ds. 
K-*+ 00 Jo J R 1 

Using the same method as for sd. we can give a lower bound for p% as follows: 
for any 77 > 0, for sufficiently large k > 0 and K > 1, 






R 1 


Pn(t — i/ 2 k , x + h, r)«;(a: ) d/i) + 77 . 


Pn+i(*> a: ’ r > e >*K') > (^«+i ^ 


t t- 2 / 2 fe 

7 "n +2 ^ 77 , bn+1 € T 




Ku k 


and sup |(z/fy 1) - 77 y n+1 1 < e ) — 77 

SG[0 n + l, , 7'n+2] 


|_t2 fc J —3 

a E 

2=0 


t-(i + l)/ 2 * 


If 'i P x 1 1 a ( @n ^ - 

t __ u _|_ 2 )/ 2 fe 

Tn +1 > - 77 -, Uer and sup 1) - n Vn \ < e 

Ku k se[e„,r„+1] 


p 2 fc J -3 

> V E* 

' ir 


2=0 

K 


1 {^^<^}L E <- + * s * +h i 1 {eo<^AT 1 } 


Pn (t-(i + 2)/2 ,V 0 ,T,e,K(v g 1)) )m(x,h)dh 


Then, as above, letting K —» + 00 , then k —> +00 and finally 77 —> 0, we obtain 

liminf ^x, t,T, s, Zk) > f fJ(x)e~ f3( ' x ' >s [ p n (t — s, x + h, F)k(x, dh)ds 7 
k^+oo J 0 J R i 

which completes the proof of Lemma 0] by induction. □ 


Proof of Lemma[2] (a) Fix 77 > 0. By Theorem|21 (a) and (c), for any K > 1, 

{v K ,l)<Z K , 

where C(Z K ) = V K (2b, 0, a, (vq , 1) + 1). 

Since sup A E^z^, 1)) < + 00 , we can choose M < +00 such that 

sup P((^,l) + 1 > M) < 77 / 3 . 

K> 1 
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Then, apply Theorem|3] (c) to P A (26,0, a, (vq , 1) + 1) with C = [1, M], 772 = M 
and r]i such that 0 < 26/a — rji < 1/2: there exists V > 0 such that 

limsupP {T k < e KV ) < 77/3, (57) 

K —>-+oo 

where T K = inf{f > 0, Z^ ^ [1/2, M + 26/a]}. 

Fix t,e > 0. Since, for s < T K , (t/ a , 1) < M + 26/a, if we apply Theorem|2](b) 
to the process {vf + ( t/KuK) - v* K uk , s > 0), we obtain, for s <T K - t/Ku K , 

A k - A k -< B k 

^s-\-(t/K uk) ‘e /Kuk — 5 

where A^ is the number of mutations occuring between 0 and s, and where B K 
is a Poisson process with parameter Kuxb{M + 26/a). Therefore, combining 1571) 
with the fact that 1/Kuk <C e KV . we obtain that, for sufficiently large I\ 

P ( A (t + e)/Ku K K /Kuk > 1) < P {B* Kuk > 1) + 277/3 

= 1 — exp(—6(M + 26/a)e) + 277/3, 

which can be made smaller than 77 if e is sufficiently small. This ends the proof 
of ll36l) . □ 

Proof of Lemma 121 (b) Fix e > 0. It follows from the construction 11511 of v K 
that, for t < Ti, under P A K r 1 

— Ox 


v? = Z?5 X , 

where C(Z K ) = P A ((1 — ux^{x))b(x), d(x),a(x, x), Zk /K). 

Therefore, by Theorem |2 (c), for K such that uk < £ and for t < t\, 

Z?’ 1 A(v?,l)AZ?’ 2 , (58) 

where C(Z K ' 1 ) = P A ((1 — e)b{x) 1 d(x),a(x, x), Zx/K) 
and C(Z K ’ 2 ) = P K (b(x), d(x), a(x, x), Zk/K). 

Now, let cj)y, resp. </> 2 , be the solution to 

</> = ((1 — e)b{x ) — d{x) — a(x , a:)</>)</>, 
resp. cf) = (6(x) — d{x) — a(x , a;)</>)/>, 


with initial state y, and observe that, for any y > 0, when t —► +00, <f>y(t) —> e 1 := 
fi x — eb(x)/a(x,x) and <py(t) — > e 2 := n x . 

Define, for any y > 0, t l E ,y the first time such that Vs > t z E ’ y , (j) l y {s) G [e l — £, e l +e] 
(i = 1, 2). Because of the continuity of the flows of these ODEs, 

t l E \= sup t l E v < +00. 

y£[z/2,1z\ 


Let us apply Theorem|31(a) to Z K ' 1 and Z K ' 2 on [0, t e \, where t E = Vt 2 : since 
Zk/K —> z, for sufficiently small 6 > 0, and for * = 1,2, 

lim p(" sup \Z*’ 1 - <f>l K / K (t)\ > s) = 0. 
k^+oo Vo <t<t s ) 
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If we choose S < e, we obtain, for i = 1,2, 


lim P(| Z?* - e i \ < 2e) = 1, 

K —»+oo 

and so, for i = 1,2, 

lim P(| Z^' 1 —n x | < Me) = 1, (59) 

K —»+oo e 

where M = 2 + b{x)/a{x , x). 

Now, assuming e sufficiently small for (M + l)e < n Xl define the stopping times 

T?'* = inf{t > t e : | Z?’* - n x \ > (M + l)e} 

for * = 1,2, and Tf = T*- 1 A Tf’ 2 . 

For any 2 £ N /K 1 define also 

pf- 1 : = P K ((1 - e)b(x),d(x),a(x,x),z). 

Then, applying Theorem^! (c) to P^’ 1 with C = [n x — Me,n x + Me], there exists 
V\ > 0 such that 

lim inf Pf’^fe > e KVl ) = 1, (60) 

K —»+oo 2GC 

where T e = inf{t > 0 : \wt — n x \ > (M + l)e}. 

Therefore, applying the Markov property at time t e , it follows from m that 

lim P(Tf^ > e KVl +t e ) = 1. 

K —>-+oo 

Similarly, there exists V 2 > 0 such that 

lim P(T5’ 2 > e KV2 + t £ ) = 1, 

K ->+00 

and thus 

lim P(If > e KV ) = 1, (61) 

K—<-+oo 

where V := V\ A V 2 . 

Now, because of ipi) . 

Vi e [te,! 1 * An], |(i/*,l)-n x | < (M +l)e. (62) 

Therefore, since log K > t e for sufficiently large K , in order to complete the proof 
of dll), it suffices to show that 

lim P(n < Tf) = 1. (63) 

K —*+co 


If we denote by the number of mutations occuring between t s and t + t s , 
by Theorem |21(b), for t such that t e + t < T* A n, 

b k a a*, 

where B K is a Poisson process with parameter Kuk{u x — (M + l)e)/z(a’)&(x). 

Therefore, if we denote by S K the first time when B? = 1, on the event 
{t e + S K < Tf }, 

n < t e + S*. 
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Since exp (—AT) <C Kuk , lim^ P (t e + S K < e KV ) = 1, and hence, by (IHT1) . 


lim P (t e +S K < T e K ) = 1, 

K —^+oo 


which implies EH - 

In the case where zk/K —> n x , using (IfiOl) as above, we obtain easily 


lim P(Sf > e KV ) = 1, 

K^+oo 

where = infjt > 0 : | Z^’ 1 — n x \ > (M + l)e, i = 1, 2}. 


K”—^+oo 


Then, the proof of 03 can be completed using the same method as the one we 


used above. 


□ 


Proof of Lemma [2] (c) Fix t > 0 and e > 0. Take K large enough for log K < 
t/KiiK ■ The Markov property at time log A' for v K yields 




(64) 


sup 

te[o,Ti] 


For any initial condition = {v$ )6 X of u K , by Theorem |3 (b), the num¬ 

ber Ak of mutations of v K between 0 and t satisfies, for any t < T\ such that 

sup se[0;t] -n x | < e, 

Bf < Cf, 

where B^ and are Poisson processes with respective parameters Kuk (n x ~ 
e)/j,(x)b(x) and KuK(n x + e)n{x)b{x). 

Therefore, on the event {sup se [o,n] \ -1) - n x \ < £}, S K < ri < T k , where 

T k is the first time when B^ = 1, and S K the first time when C^ = 1. 

Now, by Lemma 0(b), under P f ZK /K)s x ’ so, by Skorohod’s 

Theorem, we can construct N K with the same law as (^i(( g K . 1) on an auxiliary 
probability space f l such that N K (£j) —> n x for any u £ Cl. Fix u £ Cl. Then, by 
Lemma □ (b), 



and so. 



Therefore, under Pf- 


(zkIK)S^ 



< exp(— t(n x — e)fj,(x)b(x)) 
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in probability (where lim sup X n < a in probability means that, for any 77 > 0, 
P{X n > a + r]) -> 0). 

Similarly, under P f ZK / K)5x , 

liminf Pf K (t 1 > — ^-log A', sup \(v t K , 1} - n x \ < e ) 

k^+o o Ku k te[o,T!] / 

> exp (-t(n x + e)p{x)b{x)) 


in probability. 

Now, by Lemma|2](a) and (b), 

lim P^~ (n > log K) = 1 

K —»+oo K x 

and lim P ! i K , I sup \(v ?, 1) — n x \ < e ) =1. 

^ +00 —'*■= [log if,n] J 

So, using property El, it follows from El that 

lim supP^g-r (t 1 > -J— ) < exp(—t(n x - e)p(x)b{x)) 
k^+oo K x \ A u K ) 

and liminf Pf^r ( t\ > —-—) > exp (—t(n x +£)p(x)b(x)). 

K^+OO A it x J 

Since this holds for any e > 0, we have completed the proof of Lemma El (c). □ 


Proof of Lemma H The proof of this lemma follows the three steps of the 
invasion of a mutant described in Section 0(cf. Fig. EJ. 

Fix ?y > 0, £0 > 0 and 0 < e < eo- By Lemma 0 (a), there exists a constant 
p > 0 that we can assume smaller than 77 , such that, for sufficiently large A', 


PA 

-IT <5x + if Sy 


T\ < 


Ku k 


< e. 


Observe that, under PA^ , for t < n, 

~JT ° x 7T °v 


(65) 


£(({v K , 1{X}), (v K , l{y}))) = Q A ((1 - u K p{x))b{x), (1 - u K p,(y))b(y), 

d{x), d(y), a(x, x),a(x, y),a(y, x),a{y, y), z k /K , 1/A"). 

Fix K large enough for uk < £■ Define 

Sf := inf{s > 0 : (i/f, l {y} ) > e} 

By Theorem H(c) and (d), for t < 7i A Sf, 

( 66 ) 

where C(Z K ' 1 ) = P A ((1 — e)b(x),d(x) + ea(x, y),a(x, x), zk/K) 
and C(Z K ’ 2 ) = P K (b(x), d(x), a(x, x), Zk /A"). 

Using the method that led us to EU- we can deduce from Theorem 0(c) that 
there exists V > 0 such that 


lim P(Af > e KV ) = 1, (67) 

KT—>+oo 

where Rf = inf{t > 0 : | Z^’ 1 — n x \ > Me, i = 1, 2}, 
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with M = 3 + (b(x) + a(x , y))/a(x, x). 

Now, observe that, by m, 

Vt < Ti A A Rf, (v^ , l w ) £ [fia, — Me, fi x + Me], 

Therefore, by Theorem |2 (c) and (e), for t < n A 5^ A 

Z*'* ,l {y} ) <Zf A , where (68) 

C(Z K ’ 3 ) = P K ((1 - e)6(j/), d(y) + (n x + Me)a(y, x) + ea(y, y), 0,1/A') 
and C{Z KA ) = P A (b(y), d(y) + ( n x - Me)a(y, x), 0,1/A). 


Define, for any K > 1, n £ N and i £ {3,4}, the stopping time 


Tu/k = inf{t > 0 : Z^ c,i = n/K}. 
Observe that, if Sf < n A Rf, 

rpK ,4 ^ nK ^ rpK,3 

1 \eK]/K - - 1 \eK]/K 


(69) 


and that, if T A ’ 4 < T^ki/k A ti A Rf, 


0 o < if’ 4 . 

If Z K ,4 is sub-critical, apply Theorem 0] 1251) , and if Z KA is super-critical, apply 
Theorem HI (128ll (the critical case can be excluded by slightly changing the value of 
e). Since log A' 1/Kuk, we obtain 


lim P ( if’ 4 < 

K —> + 00 


—— A Tr^"’ 4 


0 “ Ku k \ sK M K i 

d{y) + (fix ~ Ms)a(y,x) 

Kv) 


A ! > ! _ %£)1± _ (70 ) 


Kv) 


b{y) 


Combining (1551) . 63 , m and GOD, and using the facts that p < y, e < e o and 
exp(A'P) > p/Kuk for sufficiently large AT, we obtain, taking K larger if necessary, 


P 0o < nA 


V 


Ku k 

> P 


V 0 =x and | (ug o , 1) - n x \ < Me 0 


< ti A Sy A Rfy A 


A UK 


and Vq = x 


>P T, 


(r 0 «<r,Ar r « 1/K AflfA_^) 


> 1- 


[jjyycjU 

b{y) 


g{y, aQ 
Ky) 


M + 3 e. 


(71) 


This ends the proof of Lemma[o]in the case where f(y,x) < 0. 

Let us assume that f(y,x) > 0, i.e. that b(y) — d{y) — n x a{y,x) > 0. If we 
choose e > 0 sufficiently small, then Z K ’ 3 is super-critical. By Theorem m (I29II . 


lim P T, 


nK,3 


k"+ oo' y\ sK MK < 2>Kuk, 

(1 - e)b{y) - d(y) - (n x + Me)a{y, x) - ea(y, y) 


> 


f(y, 


(1 - e)b(y) 


— £ 


(1 -£)b{y) 

b(y) + Ma(y, x) + a(y, y) 
(1 - £)b(y) 
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Therefore, by <E0> and assuming (without loss of generality) that e < 1/2, for 
sufficiently large A', 


p I j,K ,3 A t)K . P_ 

1 r.Jfl/JC < 1 A A 3 Ku K c)b(y) 


> JML _ m'e, 


where M' := 2{b{y) + Ma(y, x ) + a(y, y))/b(y) + 3. Then, it follows from lliiil that 


P ( Sf < n A Af A 


if 


f(y,x) 

2>Ku k J ~ (l-s)b(y) 


> 


- M's. 


Observe that, on the event {Sf < r\ A Rf A (p/3ATt_R-)}, 


(72) 


(ugK,l{y}} = \eK~\/K and \{vgK,l{x}) ~ n x \ < Ms. (73) 

Now, since we have assumed f(y,x) > 0, x and y satisfy Q and, by Proposi¬ 
tional any solution to m with initial state in the compact set [n x — Ms, n x +Ms\ x 
[e/2,2e] converges to (0, n y ) when t —> +oo. As in the proof of LemmaQ (b), be¬ 
cause of the continuity of the flow of system 03 , we can find t e < +oo large enough 
such that any of these solutions do not leave the set [0, e 2 /2] x [ n y — e/2, n v + e/2] 
after time t e . 

Apply Theorem 0 (b) on [0, t e ], with C = [n x — Ms,n x + Me] x [e/2, 2e] and 
with a constant 6 < e 2 /2 A r, where r is defined in 111 1)1) (with T = t e ). Then, with 
the notations of Theorem 0 (b), because of 17211 and 03, the Markov property at 
time Sf- yields 


K —^+oo 


liminf P( Sf < n A Rf A 


K 


P 


3Ku k ’ 


sup II «"f, !{*}), ("f, !{!,})) -</>(** ,i {x} ),{v* K M y} )( s )\\ < S ) 

:s<S“+t e S e S e J 

^ f(y, x ) 


(1 - s)b(y) 


- M's. (74) 


Now, observe that, since 5 < r, on the event 


Sf < n A Rf , 


K 


a . IK^.if*}).^.!{»})) -^,i { .j>,<^,i{v } >( s )ll 

\<S — ‘-’e £ £ ) 

for any t € [S^ , Sf? + t e ], (u^ , 1{ X }) > r — <5 > 0 and (v^ , l{ y }) > i — S > 0, and 
thus 

0o>S? + t e . 

Therefore, since S < e 2 /2 < e/2, by (IT7771) and (T7H) . for sufhciently large I \, 


P( S K < Af A 


n > 


+ te, $0 > + t e , 


3A ux 3A uk 

( v SK+t e ^{x}) <£2 an d ("sK+te’ !{?/}) G [%-£,% + £]) 

f(y,x) 


> 


(1 -s)b{y) 


- ( M’ + 2)e. (75) 


Now, we will compare (v K , 1{ X }) with a branching process after time + t e 
in order to prove that trait x gets extinct with a very high probability. We will use 
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a method very similar to the one we used in the beginning of this proof. First, on 
the event inside the probability in CZ 3 > ( I/ sf+t £ ’ 1 { x }) < £ 2 . In order to prove that 
the population with trait x stays small after + t e , let us define the stopping 
time 

Se = inf{* > S* + te ■ l{x}) > e} 

(remind that £ 2 < e since e < 1/2). Using Theorem 0(c) and (d) again, we see 
that, on the event 


pK ’ e : = {< w S c +t«> 1 {*}) < ^ e K -e,n y + e]}, 

for any t > 0 such that Sf + t e + t < S A A n, 

Z t K ’ s l(vjfK +tm+t ,l {y} )±Z t K > 6 , 

where C(Z K ’ b ) = P A ((1 -e)b(y),d(y) + ea(y, x), a(y, y), [(n y -e)K\/K) 
and C(Z K ’ 6 ) = P K (b(y), d(y), a(y, y), \(n y + e)K]/K). 


We can apply Theorem 0(c) to Z K,5 and Z K,6 as above to obtain a constant 
V' > 0 such that 


lim P(i?f > e KV ') = 1, (76) 

KT—»+oo 

where Rf = infjt > 0 : \Z A "'* — n y \ > M"s, i = 5,6}, 

with M" = 3 + (b(y) + a(y , x))/a{y, y). 

Observe that, on the event F K ’ e , for any t < Rf such that +t e +t < An, 

\( U S* +t e +t’ -*-{y}) — %l — M £, 

and so, by Theorem[3(c) and (e), on F K,e and for t as above, 

( v Sf+t s +v l{x}) — 

where C(Z K,r ) = P A (b(x), d(x) + ( n y — M"e)a(x,y), 0, |~£ 2 A'~|/A'). 

Now, since x and y satisfy (0, Z K1 is sub-critical for sufficiently small s. Fix 
such an £ > 0 and define for any n > 0 

Tn/K = mf {t > 0 : Z ?’ 7 = n/K}. 

If TfeKMK ^ and S e + U + T* K y K - T l» then 

Sf >sf +t e +f* KVK 

and if < A a and + t e + < Sf A t\ , then 

0 o < T 0 K . 


Moreover, by Theorem HU (1201) and (EH), for sufficiently large AT, 

P 


P ( T 0 K < 


3Kuk 


> 1 -£ 


and P [ff KeVK < T 0 K ) < 2s. 
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Combining the last two inequalities with m, and m, and reminding 
that p < rj and e < £q, we finally obtain, for sufficiently large K , 


V 


P ( 6» 0 < ri A , V 0 =y and \(vq q , 1) - n v \ < M"e 0 
> P ( S* < R? a 


K t^K a P n ^ OK , J. _ . 2 p 


0 O > S e + t e , T\ > 


> 


3Ku k e 3Ku k 

v s*+t e > Ry}) e [% - e, + e], < 2>K Uk A 

f{y,x) 


+ (Fsf+t e ’ ^{a:}) < £ ' 
P 


and Ii K > 


Ku k 


- {M' + 7)e. 


(1 -£)6(y) 

Adding this inequality with CD. we obtain 


P 0o < n A 


Ku k 


> 1 - 


£ /(?/,£) 


- M 


a{y,x) 

Ky) 


+ M' + 10 £ > 1 - M'"£, 


1 - £ 6(y) 

where M'" = 2 f(y,x)/b(y) + Ma(y,x)/b(y ) + M' + 10, which implies (Eldl) . and 

P(|«, 1) - n Vo \ < (M V M")£o) > 1 - M w £, 

which implies rca . 

Therefore, 

P(Uo = z) > 1 - ~ 2M '" e and p (^o = y)> n f( - y \ X u ] , ~2 M'"e. 

b{y) (1 -e)b{y) 

Since P(Vq = x) < 1 — P(Vq = y), we finally obtain iBTfll and 14011 . 


□ 
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